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SOME  LOCALLY  OPTIMAL  SUBSET  SELECTION  RULES* 

By  Deng-Yuan  Huang  and  S.  Panchapakesan 
Academia  Sinica,  Taipei  and  Southern  Illinois  University 

1.  Introduction.  Let  n0,u^ ....  .u^  be  k  +  1  independent  populations 
where  u.  has  the  associated  density  function  f(x,e^)  with  the  unknown  para¬ 
meter  0.  belonging  to  an  interval  ©  of  the  real  line.  In  this  paper,  we 
are  concerned  with  two  types  of  goals.  The  first  is  to  select  from  u,,..., 
7t those  populations,  if  any,  that  are  better  (to  be  defined)  than  nQ  which 
is  the  control  or  standard  population.  The  other  is  to  select  a  subset 
(preferably  small  in  size)  of  the  k  populations  that  will  contain 

the  best  (suitably  defined)  among  them.  In  the  recent  years,  the  attention 
has  increasingly  been  focussed  on  the  construction  of  optimal  selection 
rules.  Some  of  the  important  papers  in  this  direction  are  Bickel  and  Yahav 
[1],  Chernoff  and  Yahav  [2],  Goel  and  Rubin  [4],  Gupta  and  Hsu  [5],  Gupta 
and  Huang  [6,7],  and  Gupta  and  Kim  [10].  These  investigations  deal  with 
the  symmetric  case  which  implies  equal  sample  sizes.  There  have  been  some 
investigations  in  the  unequal  sample  sizes  case  but  these  relate  to  ad  hoc 
and  heuristic  procedures  and  are  not  generally  successful  in  establishing 
the  least  favorable  configuration  (LFC)  for  the  probability  of  a  correct 
decision.  For  many  classical  procedures  in  the  literature  for  selecting 
a  subset  containing  the  best,  the  LFC  turns  out  to  be  8^  =  ...  =  o^.  This 
provides  the  motivation  for  seeking  selection  rules  which  are  optimal  in  a 
suitable  sense  in  a  neighborhood  of  every  equi -parameter  point.  When  com¬ 
paring  these  populations  with  ttq,  the  local  optimality  concerns  with  the 

*This  research  was  supported  by  the  Office  of  Naval  Research  Contract 
N00014-75-C-0455  at  Purdue  University.  Reproduction  in  whole  or  in  part  is 
permitted  for  any  purpose  of  the  United  States  Government. 
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ability  of  the  rule  to  reject  any  population  that  is  inferior  to  rr  ,  and 
to  choose  a  population  which  is  superior  while  all  others  are  inferior. 

Section  2  deals  with  selecting  populations  that  are  superior  to  a  con¬ 
trol  and  as  the  applications  indicate  we  can  handle  unequal  sample  sizes. 

The  next  section  discusses  the  usual  goal  of  selecting  a  subset  containing 
the  best  and  locally  optimal  rules  based  on  ranks.  Though  the  density  func¬ 
tion  f(x,e)  is  assumed  to  be  known  but  for  the  value  of  0,  a  rule  based  on 
ranks  is  justified  on  the  basis  of  usual  robustness  considerations  in  that 
the  ranks  are  insensitive  to  outliers  and  there  could  be  possible  deviations 
from  the  model . 

For  detailed  discussions  on  multiple  decision  problems  and  general  de¬ 
cision  theoretic  approach,  the  reader  is  referred  to  Gupta  and  Huang  [8] 
and  Gupta  and  Panchapakesan  [12]. 

2.  Comparison  with  a  control.  We  want  to  construct  a  rule  to  select 
all  populations  that  are  better  than  the  control.  Let  { X -  - } ,  j  =  l,...,n-, 

1 J  I 

denote  the  sample  observations  from  ^ ,  i  =  0,1,..., k.  The  selection  rule 

will  depend  upon  the  observations  through  the  statistics  T^0,  i  =  l,...,k, 

where  T..  is  suitably  defined  to  indicate  the  difference  between  tt  •  and  tt.. 

1 J  *  J 

For  fixed  n_,...,n.,  we  assume  that  T . *  has  a  density  function  g  (t.f)  de- 
0  k  U  Tij.  1J 

pending  on  the  parameter  t  . . .  Usually  the  T. .  are  invariantly  sufficient 

'  J  *  J 

for  t...  We  use  the  parameter  t--  as  our  measure  of  'distance'  between  tt. 

'  J  I J  1 

and  tt  j .  Now,  is  the  same  for  all  i  and  this  common  value  is  denoted  by 
r*.  In  the  case  of  a  location  parameter  0,  we  may  have  t..  =  |e.-  e.|  and 

1 J  1  J 

in  this  case,  t*  =  0.  On  the  other  hand,  if  e  is  a  scale  parameter,  we  can 

/0i  e.  \ 

define  =  max  (—  .  g  j.  which  gives  t*  =  1 .  We  define  population 
J 

to  be  superior  if  >  t*  and  inferior  otherwise. 


3 


Let  a  =  {t|t=  (Tl0....,Tk0)  ,  fli0  =  Lt I Ti0 <. T*-}  and  - 
{t  I Tj0  =  T*  <  Tio,j=l }.  In  other  words ,  i2iQ  is  the  set  of  configura¬ 
tions  for  which  is  inferior  and  q*q  is  the  set  of  configurations  for 
which  is  superior  and  all  other  populations  are  'just  inferior'. 


2.1.  Optimality  requirement.  Let  0  <  Y  <  1  be  given.  We  wish  to 
derive  a  rule  which  satisfies 

(2.1)  sup  P  [it,-  is  selected]  <  y  for  i  =  l,...,k,  and 

“io  : 


(2.2) 


maximizes  £ 


1=1  3tio  - 


[tt1-  is  selected | 


T  •  =T* 

10 


among  all  rules  satisfying  (2.1).  We  note  that  the  condition  (2.1)  con¬ 
trols  the  error  probabilities  and  the  condition  (2.2)  maximizes  the  ef¬ 
ficiency  in  a  certain  sense  of  the  rule  in  picking  out  the  superior  popu¬ 
lations  in  a  neighborhood  of  the  configuration  t  =  (t*,...,t*)- 

Let  h  (t] . tk )  denote  the  joint  density  of  T1q . Tko  with  respect 

to  a  a-finite  measure  p.  A  selection  rule  is  given  by  6(t) 

=  {«i (t),. . . ,6k(t)},  where  fi^t)  is  the  probability  of  selecting  ^  when 
t  =  (t1,...,tk)  is  observed.  Further,  let  h^*(t)  denote  the  density  h^(t) 


when  t 

3 

io 


=  (i 


,T*),  and  let  h^(t)  denote  the  partial  derivative 


h  (t)  evaluated  at  t  =  (t*,...,t*).  Finally,  we  need  to  assume  cer¬ 


tain  regularity  conditions,  namely,  that  h  (t)  is  continuously  differenti¬ 


able  with  respect  to  each  component  of  t  and 


3t 


h  (t)  is  integrable. 


10 


Under  these  regularity  conditions,  it  is  easy  to  see  that 


~ —  PT[n.j  is  selected|fi*0] 


io  - 


Mo=T* 


where  J  is  the  sample  space  of  (T-|0,...  ’Tko>' 


i 
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Summarizing  the  above  discussion,  we  are  seeking  a  rule  6(t)  which 
satisfies 

(2.3)  /  6-(t)hT(t)dp(t)  <  Y  for  i  = 

and  maximizes,  among  all  rules  satisfying  (2.3),  the  expression 

k  / . , 

(2.4)  l  /  6  .(t)h^(t)du(t). 

i=l  j  1  "  T 

2.2.  A  locally  optimal  rule.  We  now  state  and  prove  the  main  theorem 
of  this  section. 

Theorem  2.1.  Under  all  the  assumptions  stated  previously,  a  rule 
6°(t)  which  satisfies  (2.3)  and  maximizes  (2.4)  among  all  rules  satsifying 
(2.3)  is  given  by 

II  > 

x  if  h^(t)  =  c.hT*(t) 

0  < 

where  0  <  \  <  1  and  c^  are  determined  by 

(2.6)  /  5?(t)h  *(t)dy(t)  =  y,  i  =  1 , - . - ,k. 

J  T 

Proof.  The  proof  is  straightforward  by  noting  that  for  any  rule  6 
satisfying  (2.3)  we  have 
k 

(2. 7)  /  [«1(t)-6?(t)][hJ^(t)-C.hT*(t)]dy(t)  <0. 

2.3  Applications  to  special  cases.  We  consider  the  following  cases: 
(A)  normal  means  comparison;  corrmon  known  variance  and  unequal  sample  sizes, 
(B)  normal  means  comparison;  common  unknown  variance  and  unequal  sample 


L. 
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sizes,  and  (C)  gamma  scale  parameters  comparison;  unequal  degrees  of  free¬ 
dom. 


Case  A:  Normal  means  comparison;  common  known  variance  and  unequal 
sample  sizes.  Let  ^  be  N(e.j,a  )  and  ^  be  the  sample  mean  based  on  ni 
independent  observations  from  ,  i  =  0,1,..., k.  We  take  Tio  =  -  XQ 

and  =  e,-  -  a..  Here  t*  =  0.  The  joint  density  h  (t,,...,t.)  of 
Tl0,...,T^  is  that  of  a  multivariate  normal  distribution  with  mean  vector 
t  and  covariance  matrix  z  given  by 


where  a-  =  n0/n^»  i  =  1 


,k.  It  is  easy  to  verify  that 


hjl’ftl/h  ,<t>  -  l  t/L 

T  T  j  =  l  J 

where  r”1  =  (a1'-’). 


In  the  particular  case  in  which  nQ  t  n^  =  ...  =  n^,  we  have  a-j  =  ... 
=  ak  =  a  (say).  In  this  case,  it  is  easy  to  evaluate  The  rule  6° 
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(2.9)  P  {Z. - l  Z->c  }  =  y » 

T  1  k+a  j=l  J  1 

2  2 

Z, ,. . . ,Z.  being  normal  variabl es  with  E(Z. )  =  0,  V(Z. )  =  ~~  (1  +  <*  )»  and 

i  K  1  1  71 

2  0 
E(Z.Z^.)  =  .  The  left  hand  side  of  (2.9)  is  equal  to 

P*{(k+a2-l)Z.  -  l  1.  >  c.(k+a2)}  =  1  -  $(c.(k+«2)/n), 

T  jf i  J  1 

where  $(•)  is  the  standard  normal  cdf  and 

2 

n2  =  ~  [(1  +  a2) (k  +  a2  -  1)2+  (1  +  a2)(k-  1)-  2(k-  l)(k+l-a2)  +  (k  -  l)(k  -  2)]. 
o 

Thus, 

(2.10)  c,  =  -a7*"1(l-Y). 

k+T 

Remark  2.1.  The  rule  6°  given  by  (2.8)  is  of  the  form:  select  ^ 

if  and  only  if  t.  _>  a  t-U-  7  t.  -  c,  where  c  >  0  and  a  <  1.  If,  however, 

K"  jr i  J 

we  take  a  =  1,  then  we  have  the  so-called  average-type  rule. 


Case  B:  Normal  means  comparison;  common  unknown  variance  and  unequal 

sample  sizes.  As  in  Case  A,  X.  is  the  sample  mean  based  on  ni  observations 

2  2 
from  Let  s  be  the  usual  pooled  unbiased  estimator  of  a  based  on 

v  =  (k+l)(n-l)  degrees  of  freedom.  Define  Zi  =  (X^  -  X  )/  AJ-+  and 

/  ni  no 

Tio  =  ZiZs*  1  =  Let  =  (p,j)  vvhere  =  Corr(Zi,Zj).  Then 

2  2  -1  2 

=  [(1  +  )(1  +  aj)]  2  where  =  nQ/n..  Let  A  =  (a^j)  be  the  inverse 
of  n.  Then  the  joint  density  h  (t^ ....  ,tk)  of  the  TiQ  is  the  multivariate 
t-density  (see  Dunnett  and  Sobel  [3])  given  by 


7 


where  tio  =  e.  -  e0,  i  =  1 . .  ,k. 

Now,  it  is  easy  to  verify  that 

*<:><{)  vtk 

h~  TTT~  =  ~  - 1 — ^ -  =  ^(t;v,k,{a • . }) ,  say. 

v  1+  1  F  a.  .t.t .  1J 

v  -L-  1J  1  J 


1 


Thus  the  rule  6°  is  given  by 

(2.12)  6°(t) 


1  if  t(t;v,k,{ai ^})  >_  c 


0  otherwise. 


In  the  particular  case  in  which  nQ  f  *  n^  8  ...  =  n^,  we  get 

p 

p i j  =  1/(1  + a  );  where  =  ...  =  ak  =  a.  This  gives 


(1  +a  ) (k-l+a ) 
a2(k+a2) 


aij 


1+a 


a(k+a2) 


1  =  J 


i  t  j  . 


The  constant  c^  is  given  by 

(2.13)  PT *[^  ( T-j q » •  •  •  ,T^g ; v , k , (a .j j } )  ^ c.j ]  =  y. 

It  can  be  easily  seen  that  (2.13)  can  be  rewritten  as 

(2.14)  Ptdil'AY- AJY+ vd^O]  *  Y. 

where  di  =  ci/(v  +  k),  A'  =  [A^  ...  A^j.and  Y'  =  (Y-j . . .  Yk )  has  the  multi¬ 
variate  t  density  given  by  (2.11)  with  T^0  =  0,  i  =  l,...,k.  The  left 
hand  side  of  (2.14)  is  equal  to 

P[di  (Y  -  2J7  A" ' 1  Ai  )*  A(Y  -  A' ' 1A. )  +  vd.  -  A’A^A.  <  0] 

2 

=  ptdi  (? -  sir  A_lAi y  A(?‘  A_lAi)  +  $2vdi  -  4 ar  AiA'lAi  <  o] 


where  Z  -  N(0,o  sj). 
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/v  S 
o 


— —  A-1  A  • ) 1  A(—  Z-^S 

2d./v  1  0-0 


1 _ 

2di  /v 


A'^A. ) 


+ 


1 

4di  v 


AlA'V  <  0]. 


2 

In  order  to  evaluate  the  above  probability,  we  note  that  U  =  ~  x  and  that, 

a 

2 

conditioned  on  U  =  u,  the  quadratic  form  has  a  non-central  x  distri'  ution 

on  k  degrees  of  freedom  with  the  noncentrality  parameter  x  =  x(u,d^) 

=  — AlA'^A. .  Thus  the  constant  c.  is  given  by 
4vd. 


(2.15)  ^  P[di(Vi  +  u  -  x(u,di )}  <  0]gv(u)du  =  Y. 

2 

where  V..  has  a  noncentral  x  distribution  on  k  degrees  of  freedom  with  non- 

2 

centrality  parameter  x(u,d..),  and  9y(u)  is  the  central  x  density  on  v 
degrees  of  freedom.  Tables  for  the  constant  di  are  expected  to  be  part  of 
a  future  paper. 

Case  C:  Gamma  scale  parameters  comparison;  unequal  shape  parameters. 
Let  iri  (i  =  0,1, ...,k)  be  a  gamma  population  with  the  density 

xv> 

(2.16)  f(x;9i,vi)  =  - —  expt-x/e^ },  x  >  0,  e  >  0. 

We  take  =  0^/9Q  so  that  t*  =  1 .  Let  X^,...,X^  be  independent  observa¬ 
tions  from  these  populations.  Define  TiQ  =  Xi/XQ,  i  =  l,...,k.  The  joint 
density  of  T.|0,. . .  .T^  is  easily  derived  to  be 


hT(t)  = 


k 

n 

1=0 


r(m) 


k  v.-l 

n  t.1 
i=l  1 


r(v,)- 


10 


k 

1+  l 

i=l 


t^m 

rio 


(2.17) 
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where  m  =  v  +  ...  +  v..  From  this,  we  get 


(2.18) 


mt.j 


TT(tT  i+t,+...+t. 


Thus  the  rule  6°  is  given  by 


1  mti  1  (vi  +  ci )  (1  +  ti  +  ...  +  tk) 


(2.19) 


6?(t)  = 


0  otherwise. 


The  constant  is  given  by 
(2.20)  P  * 


mXi  \ 

x  +.  .T+X  ^  vi  +  ci  =  Y 


where  X  ,...,X.  are  independent  standardized  (0=1)  gamma  variables  with 

0  K 

degrees  of  freedom  v  ,  respectively.  It  is  known  that  X ./ (X  +. .  ,+X,  ) 

OK  I  0  K 

has  a  beta  distribution  with  parameters  and  m  -  denoted  by 
B(v^,m-v1- )•  Thus  (v^  +  c^  )m"^  is  the  upper  100  y%  point  of  B(v^,m-v1- )  dis¬ 
tribution.  So  c.j  can  be  obtained  from  the  tables  of  the  incomplete  beta 
function. 

It  should  be  pointed  out  that  the  above  problem  includes  that  of  comparing 
normal  variances  based  on  unequal  sample  sizes. 


3.  A  locally  strongly  monotone  invariant  subset  selection  rule  based 
on  ranks.  In  this  section,  we  consider  the  goal  of  selecting  a  subset 
containing  the  best  population.  Subset  selection  procedures  based  on  ranks 
have  been  earlier  studied  by  several  authors  notably  among  whom  are  Gupta, 
Huang  and  Nagel  [9],  Gutpa  and  McDonald  [11],  McDonald  [13,14],  and  Nagel 
[15].  A  detailed  account  of  these  procedures  is  available  in  Gupta  and 
Panchapakesan  [12].  Of  these  authors,  Nagel  [15]  and  Gupta,  Huang  and 
Nagel  [9]  have  discussed  locally  optimal  selection  rules  but  using 


different  criteria  from  ours.  The  setup  of  our  investigation  is  analogous 
to  that  of  Gupta,  Huang  and  Nagel  [9].  We  assume  a  parametric  model  but 
seek  procedures  based  on  ranks  for  robustness  reasons  as  explained  in 
Section  1. 

3.1.  Preliminaries.  Let  be  k  independent  populations  where 

tt ^  has  the  associated  distribution  function  F(x,e^)  assumed  to  be  absolutely 
continuous  with  density  f(x,e^)  which  is  known  except  for  the  value  of  the 
parameter  e..  The  parameter  belongs  to  some  interval  on  the  real  line 
containing  the  origin.  It  is  further  assumed  that  the  density  f(x,e)  satis¬ 
fies  the  set  of  conditions  in  (3.3).  The  population  associated  with  the 
largest  is  called  the  best  population.  In  case  of  a  tie,  one  of  the  con¬ 

tenders  is  assumed  to  be  tagged  as  the  best.  Let  n  =  { e  j  e  =  (e-j  ...,0^)1  and 
S2Q  =  (0  1  e -j  =  . . .  =  e^} .  Let  { j } ,  j  =  l,...,n,  be  independent  observations 
from  n.  and  let  R,.  denote  the  rank  of  X . •  in  the  pooled  sample  of  N  =  kn 

I  '  J  *  J 

observations.  The  smallest  observation  has  rank  1  and  the  largest  N. 


Definition  3.1 .  A  rank  configuration  is  an  N-tuple  a  =  (a-j  , . . .  ,A^)  > 
A.j  e  {1 ,2,. . .  ,k},  where  a^  =  j  means  that  the  ith  smallest  observation  in 
the  pooled  sample  comes  from  tt  j . 

Let  c=  {A}  dentoe  the  set  of  all  rank  configurations  for  fixed  k  and 
n.  For  fixed  a,  let  X  =  {x  £  X  j  ax  =  A } ,  where  X  =  {x:  x=  (x^,... ,xN)} 
and  ax  denotes  the  rank  configuration  of  x  =  (xj . ,x^).  The  decision 
space  &  consists  of  all  2  subsets  (including  the  empty  set)  of  the  set 
{1 ,2, . . . ,  k  > .  Any  subset  is  denoted  by  d  so  that  &  =  Jd|dc{l,2,...,k)}. 
Any  decision  d  =  { i i .ig,. • • , i r 1  corresponds  to  selecting  the  subset  of  the 
k  populations  comprising  n-  .  Let  6 (a , d )  denote  the  probability 

71  V 
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that  the  decision  d  is  made  if  the  rank  configuration  A  is  observed. 

Definition  3.2.  A  rank  selection  rule  is  a  measurable  function  de¬ 
fined  on  c  *  &  such  that  for  each  a  E  C  ,  (i )  6(A,d)  _>  0  and 

(ii)  l  6(A,d)  =  1. 
d  £  £r 

Let  pi (a),  i  =  l,...,k,  denote  the  individual  selection  probabilities 

of  the  k  populations,  where 

(3.1)  p,(A)  =  l  6(A,d) 

1  d  3  i 

the  summation  being  over  all  subsets  containing  i.  Let  P  (,x=l,...,k)  de¬ 
al 

note  the  probability  of  including  the  population  it  in  the  selected  subset; 

a 

in  other  words,  P  =  Efp  (a)],  a  =  l,...,k.  Any  decision  d  that  corres- 
ponds  to  the  selection  of  the  best  population  is  called  a  correct  decision. 
We  assume  that  nk  is  the  best  population  and  denote  the  probability  of  a 
correct  decision  by  P (CD ] 6 , A ) .  Clearly,  P (CD | fi , a )  =  E0[pk(A)]. 

Definition  3.3.  A  selection  rule  is  strongly  monotone  if,  for  each 
i  =  l,...,k,P.  increases  in  when  all  other  components  of  e  are  fixed, 
and  P.  decreases  in  e-,  j  f  i,  when  all  other  components  of  e  are  fixed. 

'  J 

Let  6  denote  the  group  of  permutations  g  of  the  integers  (l,2,...,k). 
We  write  g(l,...,k)  =  (gl,...,gk).  Let  h  denote  the  inverse  of  g  and  de¬ 
fine  g(x^ . xk)  =  (xhl ,. . .  ,x^k)  and  gd  =  { i  |  hi  Ed],  dEi  .  Also,  for  any 

A  €  C  ,  iet  g  be  defined  by  gA  =  ( g A i .... >gA^ ) •  Thus  g  is  induced  by  g. 

Let  5  be  the  group  { g } .  Finally,  let  G(i,j  )  =  { g  £  G  |  g  i  =  j}. 

Definition  3.4.  A  selection  rule  5  is  invariant  under  permutation  if 
and  only  if  cs(gA,gd)  =  A  (  a  >  d )  for  all  a  L  C  ,  d  t  a  ,  g  tG  and  g  eG. 

An  invariant  selection  rule  is  completely  specified  by  any  one  of  its 
individual  selection  probabilities.  We  wish  to  derive  an  invariant  rank 


A 
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selection  rule  6  such  that 

(3.2)  inf  P  (CD|s,a)  =  P* 

0€QO  e- 

where  P*(  4-<P*<  1)  is  specified  in  advance  and  it  is  locally  optimal  in 
2k 

the  sense  that  it  is  strongly  monotone  in  a  neighborhood  of  any  point  o0  in 
Obviously,  the  infimum  of  the  probability  of  a  correct  decision  over 
that  neighborhood  is  attained  at  e  . 

3.2.  Derivation  of  a  locally  optimal  rule.  We  assume  that  the  den¬ 
sity  f(x,e)  satisfies  the  following  set  of  conditions: 

(i)  f(x,e)  is  absolutely  continuous  in  0  for  almost  every  x; 

(ii)  the  limit 

(3.3)  f(x,0)  =  lim  1  [f(x,e)-  f(x,0)] 

e-K) 

exists  for  almost  every  x; 

°°  •  “  •  .  . 

(iii)  lim  /  |f(x,e)|dx  =  /  |f(x,0)|dx  <  =»  where  f(x,e)  =  —  f(x,e). 
Q-K)  -co  —  1)0 

Now,  we  note  that  the  distribution  of  the  ranks  does  not  depend  upon 

the  underlying  distributions  of  the  X..  when  Ben  and  thus  P  (CD|6,a)  is 

constant  in  nQ.  Hence  we  can  satisfy  (3.2)  by  choosing  any  point  e0  in 

and  satisfying  PQ  (CD | 6 , A )  =  P*.  With  no  loss  of  generality,  we  assume 

that  eQ  =  (0,...  ,0). 

The  probability  that  the  rank  configuration  a  is  observed  under  e  with 
ei  t  0,  i  =  l,...,k,  is  given  by  (see  Gupta,  Huang  and  Nagel  [9]) 

k 

P0 ( A )  -  AQ  +  l  0iAi(A,e) 


(3.4) 


where 


XN  X2  N 


AQ  =  /  /  •••  /  n  f(xi,0)dxr..dxN, 

-OU  —  n>  -OO  i=l 


-  XN  x2  f(x,,8,)-f{x.,0) 
l  !  I  /  J  o.1  ~  J  — 

1  —CO  —  CO  —  <*>  1 


j-1  N 

n  f(x  )0)  n  f  ( x  ,  o  )dxi • . .dxM,  i  ~  1 , . .  > ,  k . 

e-1  e=j+l  e 


Here  we  define  II  f(x.,0)  •:  1. 
i=l  1 

The  following  lemma  summarizes  some  results  of  Gupta,  Huang  and  Nagel 
[9]  for  an  invariant  rule. 


Lemma  3.1.  For  an  invariant  rank  selection  rule. 


(3.5)  Po(CD|6,a)  =  E0pk(A) 

gJ(k,k)  j, 

There  exists  an  t  >  0  such  that  for  all  o  with  0  <  |o^|  <  t  ,  i  =  1 . 

k,  A.  (A,e)  is  approximately  equal  to  A.  (a)  l  B-,  where 

j:Y' 

oo  XN  X2  •  N 

B  j  =  /  /  ...  /  f(x.  ,0)  i!  f(xe,0)dxr..dxN. 

e^j 

In  this  case, 

(3.6)  P(CD|6,a)  »  j  P^ ( A ) CA  +  Tfryy  l(U-ok)V  +  (kok~U)Ak(A)  1], 

At  C  \  ~  J 


IN  r\  nn 

where  U  =  J  o.  and  V  =  j  A.  (a)  =  )’  B. ,  independent  of  A. 

1  =  1  1  i=l  1  i=l  1 
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Now,  for  deriving  a  locally  strongly  monotone  selection  procedure, 
consider 


(3.7) 


P 

a 


I 

A  €  C 


[ao  +  tftt 


{(U-  o  )V+  (ko  -  U)A  (a) }]p  (A). 

a  a  a  a 


For  any  a  =  l,...,k,  we  get 


(3.8) 


»„<*> 

a  A£  C 


3  Pa 


a€Zc  tftt  {V-VA)}pa(A)’  S^a- 


For  the  strong  monotonicity  property,  we  need  to  define  Pq(a)  such  that 

l  A  (a)p  (a)  >  0  and  £  {V-A  ( a ) }p  (a)  <  0.  This  is  obviously  accomp- 
A€Caa  A€Caa 

lished  by  defining 


(3.9) 


P  (a) 

a 


1, 

P » 

\  0, 


A  (a)  >  V  +  D 

a 

A  (a)  =  V  +  D 

a 

A  (a)  <  V  +  D 

a 


where  D  and  p  are  to  be  chosen  such  that 

(3.10)  V  +  D  >  0  and  P0  [AaU)>V+D]  +  pPo  [A  (a)  =  V  +  D]  =  P*. 

-o  -o  3 

Summarizing  the  above  discussion,  we  obtain  the  following  theorem. 


Theorem  3.1 .  Let  f(x,e)  satisfy  the  conditions  in  (3.3).  Then  the 
invariant  subset  selection  rule  based  on  ranks  defined  by  (3.9)  is  strongly 
monotone  in  a  neighborhood  of  any  point  e  e n  and  inf  P  (CD J 6 ,o )  =  P* 

provided  constants  P  and  D  satisfying  (3.10)  exist. 

Remarks.  (1)  It  is  possible  that  D  and  p  satisfying  (3.10)  may  not 
exist.  In  such  a  case,  the  rule  selects  the  empty  subset.  (Z)  When  D  and 


i 


p  satisfying  (3.10)  exist,  the  rule  (3.9)  is  also  locally  optimal  in  the 
sense  it  has  the  maximum  pQ (CD 1 6,  a) in  a  neighborhood  of  oo  among  all  in¬ 
variant  subset  selection  rules  based  on  ranks  satisfying  inf  pfl(CD|>$,A) 

e-tl2o  - 

This  follows  from  the  theorem  of  Gupta,  Huang  and  Nagel  [9].  (3)  Rules  of 

the  type  (3.9)  are  shown  to  be  "just"  by  Nagel  [15]  provided  that  B.  is  non¬ 
decreasing  in  i,  which  is  true  for  location  parameters  if  and  only  if  f(x) 
is  strongly  unimodal  (i.e.,  -log  f(x)  is  concave).  Nagel  [15]  has  also 

shwon  that  for  a  just  rule  inf  p  (CD(6,a)  =  inf  p  (CD|6,a). 

oen-  e  €  n 


3.3.  A  special  case.  If  f(x,e)  is  the  logistic  density,  i.e., 
f(x,e)  =  e~(x“e V[1  +  e~(x‘0)]2,  then  we  get  =  r  +  is,  where  the  actual 
values  of  r  and  s  >  0  are  immaterial.  The  rule  becomes 


(3.11)  Pa(A) 

where  p  and  C  satisfy 


1 

p 

0 


> 


n 


< 


V  j,  j,  lvc,“p*- 

The  above  rule  is  a  randomized  version  of  the  so-called  R3  type  rule  of 
Gupta  and  McDonald  [11]  who  proposed  it  on  an  ad  hoc  basis.  It  should 
also  be  pointed  out  that  the  rule  is  'just'  in  the  sense  of  Nagel  [15] 
which,  as  pointed  out  earlier,  implies  that  the  infimum  of  the  probability 
of  a  correct  decision  over  n  is  P*. 


16 


REFERENCES 


[1]  Bickel,  P.  J.,  and  Yahav,  J.  A.  (1977).  On  selecting  a  subset  of 
good  populations.  Statistical  Decision  Theory  and  Related  Topics  -II 
(Eds.  S.  S.  Gupta  and  D.  S.  Moore),  Academic  Press,  New  York,  pp  37-55 

[2]  Chernoff,  H. ,  and  Yahav,  J.  A.  (1977).  A  subset  selection  problem 
employing  a  new  criterion.  Statistical  Decision  Theory  and  Related 
Topics  -  II  (Eds.  S.  S.  Gupta  and  D.  S.  Moore),  Academic  Press,  New 
York,  pp.  93-119. 

[3]  Dunnett,  C.  W. ,  and  Sobel ,  M.  (1955).  Approximations  to  the  probabil¬ 
ity  integral  and  certain  percentage  points  of  a  multivariate  analogue 
of  Students'  t-distribution.  Biometrika  42,  258-260. 

[4]  Goel ,  P.  K.  and  Rubin,  H.  (1977).  On  selecting  a  subset  containing 
the  best  populations  -  a  Bayesian  approach.  Ann,  Statist.  5,  969-983. 

[5]  Gupta,  S.  S.  and  Hsu,  J.  C.  (1978).  On  the  performance  of  some  sub¬ 
set  selection  procedures.  Commun.  Statist.- Simula.  Computa.  B7, 
561-591. 

[6]  Gupta,  S.  S.  and  Huang,  D.  Y.  (1977).  On  some  r-minimax  subset  selec¬ 
tion  and  multiple  comparison  procedures.  Statistical  Decision  Theory 
and  Related  Topics- I I  (Eds.  S.  S.  Gupta  and  D.  S.  Moore),  Academic 
Press,  New  York,  pp  139-155. 

[7]  Gupta,  S.  S.  and  Huang,  D.  Y.  (1980).  A  note  on  optimal  subset  selec¬ 
tion  procedures.  Ann.  Statist.  8,  1164-1167. 

[8]  Gupta,  S.  S.  and  Huang,  D.  Y.  (1981).  Multiple  Statistical  Decision 
Theory:  Recent  Developments.  Lecture  Notes  in  Statistics,  Vol .  8, 
Springer- Verlag,  New  York. 

[9]  Gupta,  S.  S.,  Huang,  D.  Y. ,  and  Nagel,  K.  (1979).  Locally  optimal  sub 
set  selection  procedures  based  on  ranks.  Optimizing  Methods  in 
Statistics  (Ed.  J.  Rustagi),  Academic  Press,  New  York,  pp  251-262. 

[10]  Gupta,  S.  S.  and  Kim,  W.  C.  (1980).  r-minimax  and  minimax  decision 
rules  for  comparison  of  treatments  with  a  control.  Recent  Develop¬ 
ments  in  Statistical  Inference  and  Data  Analysis  (Ed.  K.  Matusita), 
North-Holland  Publishing  Co.,  Amsterdam,  pp.  55-71 . 

[11]  Gupta,  S.  S.  and  McDonald,  G.  C.  (1970).  On  some  classes  of  selec¬ 
tion  procedures  based  on  ranks.  Nonparametric  Techniques  in  Statis¬ 
tical  Inference  (Ed.  M.  L.  Puri),  Cambridge  University  Press,  London, 
pp.  419-514. 

[12]  Gupta,  S.  S.  and  Panchapakesan,  S.  (1979).  Multiple  Decision  Proce¬ 
dures  :  Theory  and  Methodology  of  Selecting  and  Ranking  Populations, 
John  Wiley  &  Sons,  New  York" 


[13]  McDonald,  G.  C.  (1972).  Some  multiple  comparison  selection  proce¬ 
dures  based  on  ranks.  Sankhya  Ser.  A  34,  53-64. 

[14]  McDonald,  G.  C.  (1973).  The  distribution  of  some  rank  statistics  with 
applications  in  block  design  selection  problems.  Sankhya  Ser.  A  35, 
187-203. 

[15]  Nagel,  K.  (1970).  On  subset  selection  rules  with  certain  optimality 
properties.  Ph.D.  Thesis.  Also  Mimeo.  Ser.  No.  222,  Dept,  of  Statist. 
Purdue  Univ. ,  West  Lafayette,  Indiana. 


SECURITY  CLASSIFICATION  OF  this  PAGE  rWbtn  Dot*  Enlmrod) 


report  documentation  page 


REPORT  NUMBER 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


Mimeograph  Series  #81-23 


4.  TITLE  (mid  Subtltl,) 

Some  Locally  Optiaml  Subset  Selection  Rules 


7.  author^; 

Deng-Yuan  Huang  and  S.  Panchapakesan 


*•  PERFORMING  organization  name  ano  address 

Purdue  University,  Dept,  of  Statistics,  W. Lafayette, 
Indiana 


<1.  CONTROLLING  OFFICE  NAME  AND  AODRESS 


4.  MONITORING  AGENCY  NAME  A  AODRESS (II  dlllorotu  Irom  Conlrolllng  Ollict) 


l«.  DISTRIBUTION  STATEMENT  (ol  0,1,  R.po rt)  ' 

Approved  for  public  release,  distribution  unlimited 


$•  type  of  report  a  period  covered 


••  performing  org.  REPORT  number 

Mimeo.  Series  #81-23 


contract  or  grant  NUMBERCa; 


N00014-75-C-0455 


to.  PROGRAM  ELEMENT.  PROJECT  task 
AREA  4  WORK  UNIT  NUMBERS 


IZ.  REPORT  DATE 

June  1981 


13-  NUMBER  OF  PAGES 

17 


15.  SECURITY  CLASS,  (of  tht*  report ) 

Unclassified 


ISa.  DECLASSIFICATION  DOWNGRADING 
SCHEDULE 


KEY  WORDS  (Contlmim  on  rmvmrmm  mido  II  nocoaamry  And  Identity  by  block  numbor) 


Subset  selection,  best  population,  comparison  with  control,  local  optimality, 
rules  based  on  ranks,  normal,  gamma,  logistic. 


00  ,ET»  1473 


_ UNCLASSIFIED 

SECURITY  CLASSIFICATION  of  This  PAGE  fOTiAn  Dai#  Ematad) 


SECURITY  CLASSIFICATION  or  This  PAGEflftw  O.l.  Entirmd) 


Let  vf.,,...,uk  be  k  =  1  independent  populations  where  has  the 
associated  density  function  f(x,e^)  with  the  unknown  parameter  belonging 
to  an  interval  «  of  the  real  line.  Two  types  of  problems  are  studied: 

(1 )  to  select  from  7^,...,^  those  populations,  if  any,  that  are  better 

.t 

(to  be  suitably  defined)  than  nQ  which  is  the  control  population;  and 

(2)  to  select  from  a  subset  preferably  of  small  size  so  as  to 

contain  the  best  population.  For  both  problems,  some  locally  optimal 
selection  rules  are  derived.  The  optimality  criteria  employed  in  the  two 
problems  are  differnt.  Further,  the  procedure  for  the  second  problem  is 
based  on  ranks  though  the  densities  are  assumed  to  be  kno-n  but  for  the 
values  of  the  parameters.  The  rule  in  the  first  case  is  applied  to  the 
special  cases  of  (i)  normal  means  comparison  with  common  known  variance 
and  unequal  sample  sizes;  (ii)  normal  means  comparison  with  common  un¬ 
known  variance  and  unequal  sample  sizes,  and  (iii)  gamma  scale  parameters 
comparison  with  unequal  shape  parameters.  The  rank  procedure  is  special¬ 
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